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Isolated photoprotein mtClytin, and the use thereof 



The invention relates to the photoprotein mtClytin, to its nucleotide and amino acid sequences and 
to the activity and use of the photoprotein mtClytin. 

Photoproteins 

The phenomenon of the generation of light by living organisms is designated bioluminescence. It is 
the result of biochemical reactions in cells, in which reactions the chemical energy is emitted in the 
form of light quanta (what is termed cold emission by means of chemoluminescence). While the 
light which is produced in this way is monochromatic, since it is emitted in connection with a 
discrete electron transfer, it can be shifted by secondary luminescent dyes (e.g. fluorescent proteins 
in the case of luminescent jellyfish of the genus Aequoria) into spectral regions of longer 
wavelength. 

Bioluminescence has a diversity of biological functions: at an ocean depth of between 200 and 
1000 m (mesopelagial), about 90% of all living organisms luminesce. In this case, the luminescent 
signals are employed for attracting partners, for deception and as a lure. Glowworms and fireflies 
also use the light signals for seeking partners. On the other hand, the significance of the 
luminescence of bacteria, fungi and single-cell algae is unclear. It is assumed that it is used for 
coordinating many single individuals in a large population or else represents a type of biological 
clock. 

A large number of coelenterates are bioluminescent (Morin et al., 1974). These organisms emit 
blue or green light. As an isolated protein, aequorin, which is derived from Aequoria victoria 
(Shimomura et al., 1969) and which, in 1962, was the first light-producing protein to be identified, 
emitted a blue light, and not a green light as observed phenotypically in the case of Aequoria 
victoria. The green fluorescent protein (GFP) which, as a result of being activated by aequorin, 
causes Aequoria victoria to appear phenotypically green was subsequently isolated from this 
medusa (Johnson et al., 1962; Hastings et al., 1969; Inouye et al., 1994). Other photoproteins 
which have also been identified and described are clytin (Inouye et al., 1993), mitrocomin (Fagan 
et al., 1993) and obelin (Illarionov et al., 1995). 
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Table 1: Overview of some photoproteins. The table gives the name, the organism from 

which the protein has been isolated and the identification number (Acc. No.) of the 
database entry. 



i. 1 H1UV 


flr*CF£inicm 




Obelin 


Obelia geniculata 


AAL86372 


Clytin 


Clytia gregaria 


CAA49754 


Aequorin 


Aequorea macrodactyla 


AAK0206 1 


Aequorin 


Aequorea parva 


AAK02060 


Mitrocomin 


Mitrocoma cellularia 


AAA29298 


Pholasin 


Pholas dactylus 


AAM 18085 


? 


Symplectoteuthis oualaniensis 


AX305029 



Table 2: Overview of some photoproteins. The table gives the organism from which the 

protein has been isolated, the name of the photoprotein and a selection of patents 
or applications. 



Organism 


Fluorescent protein 


Patent/Application 


Obelia geniculata 


Obelin 


WO03006497 


Clytia gregaria 


Clytin 


WO03006497 


Aequoria victoria 


Aequorin 


WO200168824 
US-0908909 
US 6,152,358 
JP-0176125 


Pholas dactylus 


Pholasin 


WO0028025 
GB-0024357 



Bioluminescence is nowadays used in technology in a wide variety of ways, e.g. in the form of 
bioindicators of environmental pollution or in biochemistry for sensitively detecting proteins or for 
quantifying particular compounds, or as what are termed reporters in connection with investigating 
gene regulation in the cell. 

The photoproteins differ not only in their nucleotide and amino acid sequences but also in their 
biochemical and physical properties. 

It has been demonstrated that the physical and biochemical properties of photoproteins can be 
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altered by altering the amino acid sequences of these proteins. Examples of mutagenized 
photoproteins are described in the literature (US 6,495,355; US 5,541,309; US 5,093,240; 
Shimomura et ah, 1986). 

The abovementioned photoproteins generate light by oxidizing coelenterazine (Haddock et aL, 
2001; Jones et aL, 1999). 

Reporter systems 

In general, genes whose gene products can be readily detected using simple biochemical or 
histochemical methods are termed reporter genes or indicator genes. At least 2 types of reporter 
gene are distinguished. 

1. Resistance genes. This is the term used for genes whose expression confers, on a cell, 
resistance to antibiotics or other substances whose presence in the growth medium leads to 
the death of the cell if the resistance gene is absent. 

2. Reporter genes. The products of reporter genes are used in genetic manipulation as fused 
or unfused indicators. The commonest reporter genes include beta-galactosidase (Alam et 
aL, 1990), alkaline phosphatase (Yang et aL, 1997; Cullen et aL, 1992), and luciferases and 
other photoproteins (Shinomura, 1985; Phillips GN, 1997; Snowdowne et aL, 1984). 

The emission of photons in the visible spectral range, with this emission being effected by means 
of excited emitter molecules, is termed luminescence. In contrast to fluorescence, the energy is not, 
in this case, supplied from the exterior in the form of radiation of shorter wavelength. 

A distinction is made between chemoluminescence and bioluminescence. A chemical reaction 
which leads to an excited molecule which itself luminesces when the excited electrons return to the 
basal state is termed chemoluminescence. If this reaction is catalyzed by an enzyme, the 
phenomenon is then referred to as being bioluminescence. The enzymes involved in the reaction 
are generally termed luciferases. 

Classification of the species Clytia gregaria 

Cnidaria — > Leptomedusae — > Campanulariidae — » Clytia gregaria 

The species Clytia gregaria belongs to the Cnidaria, specifically to the Medusae. The 
bioluminescent and fluorescent phenotype, respectively, has already been described in 1998 (Ward 
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et al., 1998). 
Isolating the cDNA 

In order to investigate the bioluminescence activity of the species Clytia gregaria, specimens were 
caught in the White Sea (Kartesh Biological Station, Russia) and stored in liquid nitrogen. In order 
to construct the Clytia gregaria cDNA libraries, the poly(a)+ RNA was isolated using the "Straight 
A" isolation method from Novagen (USA). 

An RT-PCR was carried out for preparing the cDNA. For this, 1 fig of RNA was incubated with 
reverse transcriptase (Superscript Gold II) in accordance with the following scheme: 



1. 30 


seconds 


95°C 


2. 6 


minutes 


68°C 


3. 10 


seconds 


95 °C 


4. 6 


minutes 


68°C 


17 cycles of step 4 after step 3 





The reaction products were incubated with proteinase K, at 37°C for 30 minutes, in order to 
inactivate the polymerase, and the cDNA was precipitated with ethanol. The cDNA expression 
library was constructed using the Clontech (USA) "SMART cDNA" library construction kit in 
accordance with the manufacturer's instruction. The cDNA was cloned into the expression vector 
pTriplEx2 (Clontech; USA). The expression vectors were transformed by electroporation into 
bacteria of the strain E. coli XL1 blue. 

The bacteria were plated out on solid LB nutrient medium and incubated at 37°C for 24 hours. A 
replica plating was then carried out, with the bacteria being transferred to another solid nutrient 
medium plate using a nitrocellulose filter. The replica plate was in turn incubated at 37°C for 24 
hours and the bacterial colonies which had grown were transferred into liquid LB medium. After 
IPTG (final concentration, 0.1 mM) had been added, the bacteria were incubated at 37°C for 4 
hours on a shaker. The bacteria were harvested by centrifugation and the bacterial mass was 
resuspended, at 0°C, in 0.5 ml of disruption buffer (5 mM EDTA, 20 mM Tris-HCL, pH 9.0). The 
bacteria were then disrupted by ultrasonication. 

After adding coelenterazine (final concentration, 10E-07 M), the lysates were incubated at 4°C for 
3 hours. The bioluminescence was then measured in a luminometer after adding calcium chloride 
(final concentration, 20 mM). 



A photoprotein was identified. The photoprotein was designated mtClytin. The photoprotein 
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mtClytin is described in detail below. 
mtClytin 

With an identity of 87%, the photoprotein mtClytin exhibits the highest homology at the amino 
acid level with clytin from Clytia gregaria and an identity of 77% with obelin from Obelia 
5 geniculata (shown in Example 8; Figure 8). The homology of 87% - in relation to clytin - occurs at 
the C-terminal end of the protein, multiple amino acid substitutions being identifiable distributed 
over the entire protein. At the nucleic acid level, the identity is less than 30% (shown in 
Example 7; Figure 7). The BLAST method (Altschul et al., 1997) was used for the sequence 
comparison. 

10 The photoprotein clytin-2 exhibits the highest homology at the amino acid level with clytin from 
Clytia gregaria. However, the sequence exhibits a number of differences in the amino acid 
sequence, with these differences being depicted in Example 1 1 (Figure 9). These differences can 
lead to changes in physicochemical, biochemical and bioluminescent properties. The photoprotein 
clytin-2 does not possess any signal peptide (as shown in Example 10). 

15 The photoprotein mtClytin possesses a signal peptide which can lead to the photoprotein being 
translocated into mitochondria. The signal peptide was identified by the computer program 
MITOPROT (Claros et al., 1996) (shown in Example 10). The signal peptide which was 
determined by MITOPROT is given in SEQ ID NO: 3. The photoprotein mtClytin is the first 
photoprotein in which a natural signal peptide for translocation into mitochondria has been 

20 identified. 

The invention also relates to functional equivalents of mtClytin. Functional equivalents are those 
proteins which have comparable physicochemical properties and are at least 70% homologous with 
SEQ ID NO: 2. Preference is given to a homology of at least 80% or 90%. A homology of at least 
95% is particularly preferred. 

25 The invention also relates to the functional equivalents of the mtClytin signal peptide. Functional 
equivalents are those proteins or peptides which have comparable physicochemical properties and 
are at least 70% homologous to SEQ ID NO: 3. Preference is given to a homology of at least 80% 
or 90%. A homology of at least 95% is particularly preferred. 

The photoprotein mtClytin is suitable for being used as a reporter gene for cellular systems, 
30 especially for receptors, for ion channels, for transporters, for transcription factors or for inducible 
systems. 
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The mtClytin signal peptide is also suitable for being fused to reporter genes in order to be used as 
a fused reporter gene for cellular systems, especially for receptors, for ion channels, for 
transporters, for transcription factors or for inducible systems. 

The photoprotein mtClytin is also suitable for being used as a reporter gene by labeling, 
identifying and characterizing cell organelles, especially for mitochondria. 

The mtClytin signal peptide is also suitable for being fused to peptides or proteins for translocation 
into cell organelles, especially mitochondria. 

The photoprotein mtClytin is also suitable for being used as a reporter gene for determining 
parameters inside and outside cell organelles, especially mitochondria, especially calcium 
concentrations. 

The mtClytin signal peptide is also suitable, as a fusion peptide, for being used as a reporter gene 
for determining parameters inside and outside cell organelles, especially mitochondria, especially 
calcium concentrations. 

The photoprotein mtClytin is suitable for being used as a reporter gene in bacterial and eukaryotic 
systems, especially in mammalian cells, in bacteria, in yeasts, in baculo and in plants. 

The photoprotein mtClytin is suitable for being used as a reporter gene for cellular systems in 
combination with bioluminescent or chemoluminescent systems, especially systems using 
luciferases, using oxygenases or using phosphatases. 

The mtClytin signal peptide is also suitable, as fusion peptide, for being used as a reporter gene for 
cellular systems in combination with bioluminescent or chemoluminescent systems, especially 
systems using luciferases, using oxygenases or using phosphatases. 

The photoprotein mtClytin is suitable for being used as a fusion protein, especially for receptors, 
for ion channels, for transporters, for transcription factors, for proteinases, for kinases, for 
phosphodiesterases, for hydrolases, for peptidases, for transferases, for membrane proteins and for 
glycoproteins. 

The mtClytin signal peptide is also suitable, as fusion peptide, for being used as a fusion protein, 
especially for receptors, for ion channels, for transporters, for transcription factors, for proteinases, 
for kinases, for phosphodiesterases, for hydrolases, for peptidases, for transferases, for membrane 
proteins and for glycoproteins. 

The photoprotein mtClytin is suitable for being immobilized, especially by antibodies, by biotin, or 
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by magnetic or magnetizable supports. 

The photoprotein mtCIytin is suitable for being used as a protein for energy transfer systems, 
especially FRET (fluorescence resonance energy transfer), BRET (bioluminescence resonance 
energy transfer), FET (field effect transistors), FP (fluorescence polarization) and HTRF 
(homogeneous time-resolved fluorescence) systems. 

The photoprotein mtCIytin is suitable for labeling substrates or ligands, especially for proteases, 
for kinases or for transferases. 

The photoprotein mtCIytin is suitable for being expressed in bacterial systems, especially for titer 
determination, as a substrate for biochemical systems, especially for proteinases and kinases. 

The photoprotein mtCIytin is suitable for being used as a label, especially coupled to antibiotics, 
coupled to enzymes, coupled to receptors or coupled to ion channels and other proteins. 

The mtCIytin signal peptide is also suitable, as fusion peptide, for being used as a label, especially 
coupled to antibiotics, coupled to enzymes, coupled to receptors or coupled to ion channels and 
other proteins. 

The photoprotein mtCIytin is suitable for being used as a reporter gene in the search for 
pharmacological active compounds, especially in HTS (high throughput screening). 

The mtCIytin signal peptide is also suitable for being used as a reporter gene in the search for 
pharmacological active compounds, especially in HTS (high throughput screening). 

The photoprotein mtCIytin is suitable for being used as a component of detection systems, 
especially for ELISA (enzyme-linked immunosorbent assay), for immunohistochemistry, for 
Western blotting or for confocal microscopy. 

The photoprotein mtCIytin is suitable for being used as a label for analyzing interactions, 
especially for protein-protein interactions, for DNA-protein interactions, for DNA-RNA 
interactions, for RNA-RNA interactions, or for RNA-protein interactions (DNA: 
desoxyribonucleic acid; RNA: ribonucleic acid). 

The photoprotein mtCIytin is suitable for being used as a label or fusion protein for expression in 
transgenic organisms, especially in mice, in rats, in hamsters and other mammals, in primates, in 
fish, in worms or in plants. 

The mtCIytin signal peptide is also suitable, as fusion peptide, for being used as a label or fusion 



Le A 36 839- Foreign Countries 



-8- 

protein for expression in transgenic organisms, especially in mice, in rats, in hamsters and other 
mammals, in primates, in fish, in worms or in plants. 

The photoprotein mtClytin is suitable for being used as a label or fusion protein for analyzing 
embryonic development. 

The photoprotein mtClytin is suitable for being used as a label by way of a coupling mediator, 
especially by way of biotin, by way of NHS (N-hydroxysulfosuccimide) or by way of CN-Br. 

The photoprotein mtClytin is suitable for being used as a reporter which is coupled to nucleic 
acids, especially to DNA or RNA. 

The photoprotein mtClytin is suitable for being used as a reporter which is coupled to proteins or 
peptides. 

The mtClytin signal peptide is also suitable, as fusion peptide, for being used as a reporter which is 
coupled to proteins or peptides. 

The photoprotein mtClytin is suitable for being used as a reporter for measuring intracellular or 
extracellular calcium concentrations. 

The photoprotein mtClytin is suitable for characterizing signal cascades in cellular systems. 

The photoprotein mtClytin which is coupled to nucleic acids or peptides is suitable for being used 
as a probe, especially for Northern blots, for Southern blots, for Western blots, for ELISA, for 
nucleic acid sequencings, for protein analyses or for chip analyses. 

The photoprotein mtClytin is suitable for being used for labeling pharmacological formulations, 
especially infectious agents, antibodies or "small molecules". 

The photoprotein mtClytin is suitable for being used for geological investigations, especially for 
ocean, groundwater and river currents. 

The photoprotein mtClytin is suitable for being expressed in expression systems, especially in 
in-vitro translation systems, in bacterial systems, in yeast systems, in baculo systems, in viral 
systems and in eukaryotic systems. 

The mtClytin signal peptide is also suitable, as fusion peptide, for being expressed in expression 
systems, especially in in-vitro translation systems, in bacterial systems, in yeast systems, in baculo 
systems, in viral systems and in eukaryotic systems. 
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The photoprotein mtClytin is suitable for visualizing tissues or cells in connection with surgical 
interventions, especially in connection with invasive, in connection with noninvasive and in 
connection with minimally invasive interventions. 

The photoprotein mtClytin is also suitable for labeling tumor tissues and other phenotypically 
altered tissues, especially in connection with histological investigation and in connection with 
surgical interventions. 

The invention also relates to the purification of the photoprotein mtClytin, especially as a wild- 
type protein, as a fusion protein and as a mutagenized protein. 

The invention also relates to the purification of the mtClytin signal peptide, especially as a wild- 
type protein, as a fusion protein and as a mutagenized protein. 

The invention also relates to the use of the photoprotein mtClytin in the field of cosmetics, 
especially bath additives, lotions, soaps, body dyes, toothpaste and body powders. 

The invention also relates to the use of the photoprotein mtClytin for dyeing, especially dyeing 
foodstuffs, bath additives, ink, textiles and plastics. 

The invention also relates to the use of the photoprotein mtClytin for dyeing paper, especially 
greetings cards, paper products, wallpapers and handicraft articles. 

The invention also relates to the use of the photoprotein mtClytin for dyeing liquids, especially for 
water pistols, fountains, beverages and ice. 

The invention also relates to the use of the photoprotein mtClytin for producing toys, especially 
finger dye and makeup. 

The invention relates to nucleic acid molecules which encode the polypeptide which is disclosed 
by SEQ ID NO: 2. 

The invention relates to nucleic acid molecules which encode the polypeptide which is disclosed 
by SEQ ID NO: 3. 

The invention relates to nucleic acid molecules which encode the polypeptide which is disclosed 
by SEQ ID NO: 6. 

The invention relates to the polypeptide having the amino acid sequence which is disclosed in SEQ 
ID NO: 2. 
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The invention relates to the polypeptide having the amino acid sequence which is disclosed in SEQ 
ID NO: 3. 

The invention relates to the polypeptide having the amino acid sequence which is disclosed in 
SEQ ID NO: 6. 

The invention furthermore relates to nucleic acid molecules which are selected from the group 
consisting of 

a) nucleic acid molecules which encode a polypeptide which contains the amino acid 
sequence disclosed by SEQ ID NO: 2; 

b) nucleic acid molecules which contain the sequence depicted by SEQ ID NO: 1; 

c) nucleic acid molecules whose complementary strand hybridizes with a nucleic acid 
molecule from a) or b) under stringent conditions and which encode a polypeptide which 
exhibits the biological function of a photoprotein; 

A stringent hybridization of nucleic acid molecules can be carried out, for example, in an 
aqueous solution comprising 0.2 x SSC (Ix standard saline citrate = 150 mM NaCl, 
15 mM trisodium citrate) at 68°C (Sambrook et aL, 1989). 

d) nucleic acid molecules which differ from the nucleic acid molecules mentioned under c) 
due to the degeneracy of the genetic code; 

e) nucleic acid molecules which exhibit a sequence homology with SEQ ID NO: 1 of at least 
95% and whose protein product exhibits the biological function of a photoprotein; and 

0 nucleic acid molecules which exhibit a sequence homology with SEQ ID NO: 1 of at least 
65% and whose protein product exhibits the biological function of a photoprotein. 

The invention also relates to nucleic acid molecules which exhibit a sequence homology with SEQ 
ID NO: 1 or SEQ ID NO: 5 of at least 95%, 90%, 85%, 80%, 75%, 70%, 65% or 60% and which 
encode a polypeptide which possesses the properties of a photoprotein. 

The invention also relates to nucleic acid molecules which exhibit a sequence homology with 
SEQ ID NO: 4 of at least 95%, 90%, 85%, 80%, 75%, 70%, 65% or 60% and which encode a 
polypeptide which possesses the properties of a signal or leader peptide. 

The invention relates to the abovementioned nucleic acid molecules in which the sequence 
contains a functional promoter 5' to the photoprotein-encoding sequence or to the leader- or 
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signal-sequence-encoding sequence. 

The invention also relates to nucleic acid molecules as previously described which are constituents 
of recombinant DNA or RNA vectors. 

The invention relates to organisms which harbor such a vector. 

The invention relates to oligonucleotides having more than 10 consecutive nucleotides which are 
identical or complementary to the DNA or RNA sequence of the mtClytin molecules or of the 
other molecules according to the invention. 

The invention relates to photoproteins which are encoded by the previously described nucleotide 
sequences. 

The invention relates to methods for expressing the photoprotein polypeptides according to the 
invention in bacteria, in eukaryotic cells or in in-vitro expression systems. 

The invention also relates to methods for purifying/isolating a photoprotein polypeptide according 
to the invention. 

The invention relates to peptides which have more than 5 consecutive amino acids and which are 
immunologically recognized by antibodies directed against the photoproteins according to the 
invention. 

The invention relates to the use of the photoprotein-encoding nucleic acids according to the 
invention as marker genes or reporter genes, in particular for searching for pharmacological active 
compounds and for diagnostics. 

The invention relates to the use of the photoproteins according to the invention or of a 
photoprotein-encoding nucleic acid according to the invention as labels or reporters or as a marker 
gene or reporter gene. 

The invention relates to the use of the photoprotein mtClytin (SEQ ED NO; 2), or to the use of a 
nucleic acid which encodes the photoprotein mtClytin as a label or reporter, or as a label or 
reporter gene, in particular for searching for pharmacological active compounds and for 
diagnostics. 

The invention relates to the use of the nucleic acid depicted in SEQ ID NO: 1 as a marker gene or 
reporter gene, in particular for searching for pharmacological active compounds and diagnostics. 

The invention relates to the use of the peptide depicted in SEQ ID NO: 6 and its underlying nucleic 
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acid sequence SEQ ID NO: 5 as a marker gene or reporter gene, in particular for searching for 
pharmacological active compounds and diagnostics. 

The invention also relates to polyclonal or monoclonal antibodies which recognize a polypeptide 
according to the invention. 

The invention also relates to monoclonal or polyclonal antibodies which recognize the 
photoprotein mtClytin (SEQ ID NO: 2) or the photoprotein clytin-2 (SEQ ED NO: 6). 

The invention also relates to monoclonal or polyclonal antibodies which recognize the signal 
peptide of the photoprotein mtClytin (SEQ ID NO: 3). 

The invention furthermore relates to a nucleic acid molecule which is selected from the group 
consisting of 

a) nucleic acid molecules which encode a polypeptide which contains the amino acid 
sequence disclosed by SEQ ID NO: 3; 

b) nucleic acid molecules which contain the sequence depicted by SEQ ID NO: 4; 

c) nucleic acid molecules whose complementary strand hybridizes with a nucleic acid 
molecule from a) or b) under stringent conditions and which encode a peptide which 
exhibits the biological function of a signal or leader peptide; 

d) nucleic acid molecules which differ from the nucleic acid molecules mentioned under c) 
due to the degeneracy of the genetic code; 

e) nucleic acid molecules which exhibit a sequence homology with SEQ ED NO: 4 of at least 
95% and encode a peptide which has the biological function of a signal or leader peptide; 
and 

f) nucleic acid molecules which exhibit a sequence homology with SEQ ID NO: 4 of at least 
65% and encode a peptide which has the biological function of a signal or leader peptide. 

The invention likewise relates to a nucleic acid molecule which is selected from the group 
consisting of 

a) nucleic acid molecules which encode a polypeptide which contains the amino acid 
sequence disclosed by SEQ ID NO: 6; 

b) nucleic acid molecules which contain the sequence depicted by SEQ ID NO: 5; 
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c) nucleic acid molecules whose complementary strand hybridizes with a nucleic acid 
molecule from a) or b) under stringent conditions and which encode a polypeptide which 
exhibits the biological function of a photoprotein; 

d) nucleic acid molecules which differ from the nucleic acid molecules mentioned under c) 
due to the degeneracy of the genetic code; 

e) nucleic acid molecules which exhibit a sequence homology with SEQ ID NO: 5 of at least 
95% and encode a polypeptide which has the biological function of a photoprotein; and 

f) nucleic acid molecules which exhibit a sequence homology with SEQ ED NO: 5 of at least 
80% and encode a polypeptide which has the biological function of a photoprotein. 

The invention also relates to a nucleic acid which is as described in the preceding paragraphs and 
which contains a functional promoter 5" to the coding sequence. 

The invention includes recombinant DNA or RNA vectors which contain the previously described 
nucleic acids. 

Organisms which harbor a vector as previously described are likewise in accordance with the 
invention. 

The invention also relates to oligonucleotides having more than 10 consecutive nucleotides which 
are identical or complementary to a constituent sequence of a nucleic acid molecule as described 
above. 

The invention also relates to a polypeptide which is encoded by a nucleic acid sequence as 
described above. 

The invention also relates to a method for expressing the abovementioned polypeptides in bacteria, 
viral cells, yeasts or eukaryotic cells or in in-vitro expression systems. 

The invention likewise relates to a method for purifying/isolating a polypeptide according to the 
invention. 

The invention likewise relates to peptides having more than 5 consecutive amino acids which are 
recognized immunologically by antibodies directed against the photoprotein mtClytin. 

The invention furthermore relates to peptides having more than 5 consecutive amino acids which 
are recognized immunologically by antibodies directed against the photoprotein clytin-2. 
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The invention also relates to peptides having more than 5 consecutive amino acids which are 
recognized immunologically by antibodies directed against the signal or leader peptide disclosed 
by SEQ ID NO: 3. 

The invention also relates to peptides having more than 5 consecutive amino acids which are 
recognized immunologically by antibodies directed against the photoprotein disclosed by SEQ ID 
NO: 6 (clytin-2). 

The invention relates to the use of a nucleic acid according to the invention as a marker gene or 
reporter gene. 

The invention also relates to the use of a photoprotein according to the invention as a label or 
reporter. 

The invention furthermore relates to the use, as signal or leader sequence, of a nucleic acid which 
contains the sequence depicted as SEQ ID NO: 4 or a sequence having 60%, 65%, 70%, 75%, 
80%, 85% or 90%, preferably having 95%, sequence identity with SEQ ID NO: 4. 

The invention also relates to the use, as signal or leader peptide, of a peptide which contains the 
sequence depicted as SEQ ID NO: 3 or a sequence having 60%, 65%, 70%, 75%, 80%, 85% or 
90%, preferably having 95%, sequence identity with SEQ ID NO: 3. 

The invention likewise relates to the use, which is described in the two preceding paragraphs, for 
transporting proteins which are fused to the signal or leader peptide into cell organelles. 

The invention also relates to the use which is described in the preceding paragraph, with the cell 
organelles being mitochondria. 

The invention also relates to the use which is described in the preceding paragraph, with the cell 
organelles being the endoplasmic reticulum (ER). 

The invention furthermore relates to the use of the nucleic acid sequence which is depicted as SEQ 
ID NO: 4 as a signal or leader sequence. 

The invention also relates to the use of the peptide which is depicted as SEQ ID NO: 3 and which 
contains the depicted sequence as a signal or leader peptide. 

The invention likewise relates to the use which is described in the two preceding paragraphs for 
transporting a protein which is fused to the signal or leader peptide into cell organelles. 

The invention also relates to the use which is described in the preceding paragraph, with the cell 
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organelles being mitochondria. 

The invention also relates to the use which is described in the preceding paragraph, with the cell 
organelles being the endoplasmic reticulum (ER). 

The invention likewise relates to the use of the polypeptides according to the invention as reporter 
5 proteins in searching for pharmacological active compounds. 

Finally, the invention also relates to the use of the nucleic acids according to the invention as 
reporter genes in searching for pharmacological active compounds. 

Expressing the photoproteins of the invention 

10 The production of a molecule which, after the gene has been introduced into a suitable host cell, 
enables the foreign gene which is cloned into an expression vector to be transcribed and translated 
is termed expression. Expression vectors contain the control signals which are required for 
expressing genes in prokaryotic or eukaryotic cells. 

15 In principle, expression vectors can be constructed in two different ways. In the case of what are 
termed transcription fusions, the protein encoded by the cloned-in foreign gene is synthesized as an 
authentic, biologically active protein. For this purpose, the expression vector carries all the 5' and 
3' control signals which are required for the expression. 

20 In the case of what are termed translation fusions, the protein encoded by the cloned-in foreign 
gene is expressed, together with another protein which can be detected readily, as a hybrid protein. 
The 5' and 3' control signals which are required for the expression, including the start codon and, 
possibly, a part of the sequences encoding the N-terminal regions of the hybrid protein to be 
formed, originate from the vector. The additional inserted protein moiety not only in many cases 

25 stabilizes the protein, which is encoded by the cloned-in foreign gene, against breakdown by 
cellular proteases; it can also be used for detecting and isolating the hybrid protein which is 
formed. The expression can take place either transiently or stably. Suitable host organisms are 
bacteria, yeasts, viruses or eukaryotic systems. 
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Purifying the photoproteins of the invention 

The isolation of proteins (after they have been overexpressed as well) is frequently termed protein 
purification. A large number of established methods are available for purifying proteins. 

5 

The solid/liquid separation is a basic operation in connection with isolating proteins. This 
procedural step is required when separating cells from the culture medium, when clarifying the 
crude extract after having disrupted the cells and removing the cell debris, and when separating off 
sediments after precipitations, etc. It takes place by means of centrifugation and filtration. 

10 

In order to obtain intracellular proteins, the cell wall must be destroyed or rendered permeable. 
High-pressure homogenizers or agitator ball mills or glass bead mills are used for this purpose, 
depending on the scale and the organism. Mechanical cell integrations and ultrasonic treatment are 
used, inter alia, on the laboratory scale. 

15 

Both in the case of extracellular proteins and in the case of intracellular proteins (following cell 
disruption), various precipitation methods using salts (in particular ammonium sulfate) or organic 
solvents (alcohols or acetone) represent rapid and efficient methods for concentrating proteins. 
When intracellular proteins are being purified, it is desirable to remove the soluble nucleic acids 
20 (precipitation with, for example, streptomycin sulfate or protamine sulfate). When extracellular 
proteins are being isolated, carriers (e.g. starch or kieselguhr) are frequently added before adding 
the precipitating agents in order to obtain sediments which are easier to handle. 

Numerous chromatographic methods and partition methods (absorption chromatography and ion 
25 exchange chromatography, gel filtration, affinity chromatography and electrophoreses) are 
available for high-degree purification. Column chromatography is also used on an industrial scale. 
Affinity chromatography, which makes possible purification factors of up to several 100s per step, 
is especially important for the laboratory scale. 

30 Extracellular proteins accrue in relatively dilute solutions. Just like extracellular proteins, they 
have to be concentrated before being subjected to further use. In addition to the methods which 
have already been mentioned, ultrafiltration has proved to be of value, on an industrial scale as 
well. 

35 Inorganic salts which accompany proteins are frequently undesirable in the case of specific 
applications. They can be removed by, inter alia, gel filtration, dialysis and diafiltration. 
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A large number of proteins are used as dry preparations. Important drying methods are vacuum 
drying, freeze drying and spray drying. 

5 Nucleotide and amino acid sequences 

The photoprotein mtClytin is encoded by the following nucleotide sequence (SEQ ID NO: 1): 

gacagataaaaaattcactccttagattatttagtgaataagagaaaaaaaggataagaaatcaag 
atgcaaaggtttacaaatcgtcttctttccatgtcggctttacgtgcaagatcaagattgcaacgc 

10 acggcaaattttcacaccagcatactcttggctacagattcaaaatacgcggtcaaactcgatcct 
gattttgcaaatccaaaatggatcaacagacacaaatttatgttcaactttttggacataaacggt 
aaggggaaaatcacattagatgaaatcgtctccaaagcttcagacgacatttgtgctaaactggat 
gcaacaccagaacagaccaaacgtcaccaggatgctgttgaagcctttttcaagaaaatgggcatg 
gattatggtaaagaagttgcattcccagaatttattaagggatgggaagagttggccgaacacgac 

15 ttggaactctggtctcaaaacaaaagtacattgatccgtgaatggggagatgctgttttcgacatt 
ttcgacaaagacgcaagtggctcaatcagtttagacgaatggaaggcttacggacgaatctctgga 
atctgtccatcagacgaagacgctgagaagacgttcaaacattgtgatttggacaacagtggcaaa 
cttgatgttgatgagatgaccaggcaacatttaggcttctggtacacattggatccaacttctgat 
ggtctttatggcaattttgttccctaagaagcgttcagttaaaaacgctaaacattgttcagttgt 

20 aaaattatat teat tt teat ttcgtaaaattagtatttataaatttgtatcataaattgtatccat 
gttgtagactaaataagactcggcaaaaaaaaaaaaaaaaaaaaaaaaaaaaaa -3 v . 

This yields an amino acid sequence of (SEQ ID NO: 2): 

MQRFTNRLLSMSALRARSRLQRTA]^ 
FLDINGKGKJTLDEIVS KASDDIC AKXDA 
25 FIKG\\^ELAEHDLELWSQNKSTLmEWGDAVFDIFDKDASGSISLDEWKAYGRISGICPSDE 
DAEKTFKHCDLDNSGKLDVDEMTRQHLGFWYTLDPTSDGLYGNFVP 

The putative signal peptide of the photoprotein mtClytin possesses the following sequence 
(SEQ ID NO: 3): 

MQRFTNRLLSMSALRA 

30 and has the following nucleic acid sequence: 

5 V - atgcaaaggtttacaaatcgtcttctttccatgtcggctttacgtgca - 3 V (SEQ ID NO 4) 

The photoprotein clytin-2 is encoded by the following nucleotide sequence (SEQ ID NO: 5): 
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GATCTCAGCTCAACTTGCAATAAGTATCAGATCAAATTTTGCAACTCAAAGCAAATCA 

TCAACTTCATCATAATGACTGACACTGCTTCAAAATACGCTGTCAAACTCAAGACCAA 

CTTTGAAGATCCAAAATGGGTCAACAGACACAAATTTATGTTCAACTTTTTGGACATT 

AACGGCAACGGAAAAATCACTTTGGATGAAATTGTCTCCAAAGCTTCGGATGACATTT 

GCGCCAAACTTGGAGCTACACCAGCTCAAACCCAACGTCATCAGGAAGCTGTTGAAGC 

TTTCTTCAAGAAGATTGGTTTGGATTATGGCAAAGAAGTCGAATTCCCAGCTTTCGTTA 

ACGGATGGAAAGAACTGGCCAAACATGACTTGAAACTTTGGTCCCAAAACAAGAAAT 

CTTTGATCCGCAATTGGGGAGAAGCTGTATTCGACATTTTCGACAAGGACGGAAGTGG 

CTCAATCAGTTTGGACGAATGGAAAACATACGGAGGAATCTCTGGAATCTGTCCATCA 

GACGAAGACGCTGAAAAGACCTTCAAACATTGCGATTTGGACAACAGTGGCAAACTT 

GATGTTGACGAGATGACCAGACAACATTTGGGATTCTGGTACACCTTGGACCCTAACG 

CTGATGGTCTTTATGGCAACTTTGTCCCTTAAAA^ 

GGTTATTTTTTCATAATTGTCATTTGATTTTAACTTTGTTTCGGAAAAT^ 
CTTTATTCAGAAAAAAAAAAAAAAAAAAAAAAAAA - 3 V 

This yields an amino acid sequence of (SEQ ID NO: 6): 

MTDTASKYAVKLKTKreDPKWVNRHKPMFNFLDEN 
PAQTQRHQEAVEAJTKKIGLDYGKEVE 

AVFDIFDKDGSGSISLDEWKTYGGISGICPSDEDAEKTFKHCDLDNSGKLDVDEMTRQHLG 
FWYTLDPNADGLYGNFVP 

These sequences are reproduced in the sequence listing. 
Brief description of the Figures 

Figure 1: Figure 1 shows the plasmid map of the vector pTriplEX2-mtClytin. 

Figure 2: Figure 2 shows the plasmid map of the vector pcDNA3-mtClytin. 

Figure 3: Figure 3 shows the result of the bacterial expression of mtClytin and the 

bioluminescence activity of mtClytin following bacterial expression. (Y = RLU: 
relative light units; X = dilution; black bar = mtClytin; gray bar = control lysate). 

Figure 4: Figure 4 shows the result of the eukaryotic expression of mtClytin and the 

bioluminescence activity of mtClytin following expression in CHO cells. (Y = 
RLU: relative light units; X = ATP (logarithmic representation in mol/1)). 



Figure 5: 



Figure 5 shows the kinetic analysis of the bioluminescence of mtClytin. (Y = RLU: 
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relative light units; X = time [seconds]). 

Figure 6: Figure 6 shows the kinetic analysis of the bioluminescence of obelin. (Y = RLU: 

relative light units; X = time [seconds]). 

Figure 7: Figure 7 shows the alignment of clytin and mtClytin at the amino acid level. 

5 Figure 8: Figure 8 shows the alignment of clytin and mtClytin at the nucleic acid level. 

Figure 9: Figure 9 shows the alignment of clytin, mtClytin and clytin-2 at the amino acid 

level. 
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Examples 
Example 1 

The Clontech plasmid pTriplEx2 was used as vector for preparing the construct which is described 
below. The derivative of the vector was designated pTriplEx2-mtClytin. The vector pTriplEx2- 
mtClytin was used for expressing mtClytin in bacterial systems. 

Figure 1 shows the plasmid map of the vector pTriplEX2-mtClytin. 

Example 2 

The Clontech plasmid pcDNA3.1(+) was used as the vector for preparing the construct which is 
described below. The derivative of the vector was designated pcDNA3-mtClytin. The vector 
pcDNA3-mtClytin was used for expressing mtClytin in eukaryotic systems. 

Figure 2 shows the plasmid map of the vector pcDNA3-mtClytin. 

Example 3 

Bacterial expression 

The bacterial expression was effected in the E. coli strain BL21(DE3) by transforming the bacteria 
with the expression plasmids pTriplEX2-mtClytin and pTriplEX2. The transformed bacteria were 
incubated at 37°C for 3 hours in LB medium and the expression was induced for 4 hours by adding 
IPTG up to a final concentration of 1 mM. The induced bacteria were harvested by centrifugation, 
resuspended in 50 mM Tris/HCl (pH 9.0) + 5 mM EDTA and disrupted by ultrasonication. The 
lysate was subsequently centrifuged at 13 000 rpm (16 000 ref) for 15 minutes and the supernatant 
removed. The supernatant (dilutions 1:5, 1:10; 1:20 and 1:50 with Tris/HCl pH 9.0)) was 
incubated with coelenterazine (10E-07 M coelenterazine in Tris/HCl pH 9.0) for 3 hours in the 
dark. The bioluminescence was measured in a luminometer directly after adding 5 mM calcium 
chloride. The measurement integration time was 40 seconds. 

Figure 3 shows the results of measuring the bioluminescence of mtClytin in bacteria. 
Example 4 

Eukaryotic expression 

Constitutive eukaryotic expression was effected in CHO cells by transfecting the cells with the 
expression plasmids pcDNA3-mtClytin and pcDNA3.1(+) in transient experiments. For this, 
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10 000 cells per well were plated out, in DMEM-F12 medium, on 96-well microtiter plates and the 
plates were incubated overnight at 37°C. Transfection was effected using the Fugene 6 kits 
(Roche) in accordance with the manufacturer's instructions. The transfected cells were incubated 
overnight in DMEM-F12 medium at 37°C. The medium was then removed and replaced with 50 jxl 
of coelenterazine (10E-07 M coelenterazine in PBS). The cells were incubated at 37°C for 3 hours 
and ATP (adenosine triphosphate) was then added to a final concentration of 1 nM. The 
measurement in a luminometer was started directly after the addition. The integration time was 
1 second, with the total measurement time being 60 seconds. 

Figure 4 shows the results of measuring the bioluminescence of mtClytin in CHO cells. 

Example 5 

BLAST 

Result of a BLAST analysis of mtClytin at the amino acid level: 

>emb | CAD87655 . 1 | unnamed protein product [Clytia gregaria] , Length 
= 198, Score = 368 bits (945), Expect = e-101. Identities = 
171/195 (87%), Positives = 182/195 (92%) 

>sp | Q0812 1 | CLYT_CLYGR Clytin precursor (Phialidin) , pir||S28860 
clytin - hydromedusa (Clytia gregarium) , emb | CAA49754 . 1 | clytin 
[Clytia gregaria], gb | AAA282 93 . 1 | apoclytin, Length = 198, Score = 
368 bits (945), Expect = e-101, Identities = 171/195 (87%), 
Positives = 182/195 (92%) 

>emb | CAD87 658 . 1 | unnamed protein product [synthetic construct], 
Length = 198, Score = 367 bits (943), Expect = e-101, Identities 
= 170/195 (87%), Positives = 182/195 (93%) 

>sp|Q277 09 |OBL_OBELO Obelin precursor (OBL) , pdb|lEL4|A Chain A, 
Structure Of The Calcium-Regulated Photoprotein Obelin, Determined 
By Sulfur Sas , gb | AAA67708 . 1 | unnamed protein product, Length = 
195, Score = 327 bits (837), Expect = le-88, Identities = 150/193 
(77%) , Positives = 170/193 (87%) 

>emb | CAD87674 . 1 | unnamed protein product [synthetic construct], 
Length = 195, Score = 326 bits (835), Expect = 2e-88, Identities 
= 149/193 (77%) , Positives = 170/193 (87%) 
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>emb | CAD87672 . 1 1 unnamed protein product [synthetic construct], 
Length = 195, Score = 325 bits (834), Expect = 3e-88, Identities 
= 149/193 (77%), positives = 170/193 (87%) 

>emb | CAD87673 . 1 | unnamed protein product [synthetic construct], 
Length = 195, Score = 325 bits (833), Expect = 4e-88, Identities 
= 149/193 (77%), Positives = 170/193 (87%) 

>pdb|lJF0|A Chain A, The Crystal Structure Of Obelin From Obelia 
Geniculata At 1.82 A Resolution, gb | AAL86372 . 1 | AF3 94688_1 
apoobelin [Obelia geniculata], Length = 195, Score = 325 

bits (833), Expect = 4e-88, Identities = 149/193 (77%), Positives 
= 168/193 (86%) 

Example 6 

BLAST 

Result of a BLAST analysis of mtClytin at the nucleic acid level: 

>emb|AX702125.1| Sequence 23 from Patent WO03006497, Length = 597, 
Score = 669 bits (348), Expect = 0.0, Identities = 504/582 (86%) 

>emb| AX702119 . 1 | Sequence 17 from Patent WO03006497, Length = 597, 
Score = 669 bits (348), Expect = 0.0, Identities = 504/582 (86%) 

>emb | X7 0221 . 1 | CGCLYTIN C.gregaria mRNA for clytin, Length = 747, 
Score = 669 bits (348), Expect =0.0, Identities = 504/582 (86%) 

>gb| L13247 . 1 | CY1APOCLYT Clytia gregarium apoclytin mRNA, complete 
cds, Length = 747, Score = 669 bits (348), Expect = 0.0, 
Identities = 504/582 (86%) 

>emb| AX702187 . 1 1 Sequence 85 from Patent WO03006497, Length = 597, 
Score = 664 bits (345), Expect = 0.0, Identities = 503/582 (86%) 

>emb| AX702185 . 1 | Sequence 83 from Patent WO03006497, Length = 597, 
Score = 664 bits (345), Expect = 0.0, Identities = 503/582 (86%) 

>emb| AX702183 . 1 | Sequence 81 from Patent WO03006497, Length = 597, 
Score = 664 bits (345), Expect = 0.0, Identities = 503/582 (86%) 
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>emb| AX702181 . 1 | Sequence 79 from Patent WO03006497, Length = 597, 
Score = 664 bits (345), Expect = 0.0, Identities = 503/582 (86%) 



>emb| AX702179 . 1 | Sequence 77 from 
Score = 664 bits (345) , Expect = 

>emb| AX7 02131 . 1 | Sequence 2 9 from 
Score = 664 bits (345), Expect = 



Patent WO03006497, Length = 597, 
0.0, Identities = 503/582 (86%) 

Patent WO03006497, Length = 597, 
0.0, Identities = 503/582 (86%) 



>emb| AX702129 . 1 | Sequence 27 from Patent WO03006497, Length = 597, 
Score = 664 bits (345), Expect = 0.0, Identities = 503/582 (86%) 

Example 7 

Figure 7 shows the alignment of mtClytin with clytin (Clytia gregaria) at the nucleic acid level. 
Example 8 

Figure 8 shows the alignment of mtClytin with clytin (Clytia gregaria) at the amino acid level. 
Example 9 

Kinetic analysis of mtClytin 

For the kinetic analysis of the bioluminescence of mtClytin, CHO cells were transiently transfected 
with pcDNA3-mtClytin or pcDNA-obelin or pcDNA3 (without any integrated cDNA). The 
transfection and measurement were carried out as described in Example 4. The readings were 
taken for a period of 60 seconds using an integration time of 1 second. 

Figures 5 and 6 show the results of the kinetic analysis of mtClytin and obelin. 
Example 10 
MITOPROT Analysis 

The computer program MITOPROT was used to analyze the mtClytin signal peptide (Claros et al., 
1996). The following photoproteins were analyzed: obelin (Q27709), aequorin (P07164), clytin 
(Q08121) and mtClytin (SEQ ID NO: 2). 
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Results of the analyses: 



Obelin: 

Sequence name : OBELIN 

Input sequence length : 195 aa 



VALUES OF COMPUTED PARAMETERS 
Net charge of query sequence : -11 

Analysed region : 11 

Number of basic residues in targeting sequence : 3 
Number of acidic residues in targeting sequence : 0 
Cleavagesite : not predictable 

Cleaved sequence : 



HYDROPHOBIC SCALE USED 





GES 


KD 


GVH1 


ECS 


H17 : 


-0 . 624 


0.259 


-0.308 


0.295 


MesoH : 


-1 . 573 


-0 .241 


-0 . 642 


0 . 060 


MuHd_07 5 : 


14 . 019 


3 . 641 


4 .408 


1.523 


MuHd_0 9 5 : 


7 .994 


7 .898 


3 .285 


1 . 838 


MuHd_l 0 0 : 


13 .734 


9 .836 


5 . 597 


2 .742 


MuHd_105 : 


21 . 195 


11.755 


7.339 


4 . 117 


Hmax_07 5 : 


-9 .450 


-2 .800 


-4 . 008 


1. 132 


Hmax_095 : 


-0 . 963 


1.837 


-1.971 


1 . 103 


Hmax_100 : 


0 .400 


1.300 


-1. 942 


2 .240 


Hmax_105 : 


10 . 617 


6.067 


0.733 


3 . 127 



PROBABILITY 

of export to mitochondria: 0.1479 
Aequorin : 

Sequence name: AEQUORIN 

Input sequence length : 196 aa 



VALUES OF COMPUTED PARAMETERS 

Net charge of query sequence : -13 

Analysed region : 3 

Number of basic residues in targeting sequence : 0 

Number of acidic residues in targeting sequence : 0 
Cleavage site : not predictable 
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Cleaved sequence : 







HYDROPHOBIC 


SCALE USED 






GES 


KD 


GVH1 


ECS 


H17 : 


0 . 006 


0 .794 


-0.263 


0 .368 


MesoH : 


-1. 673 


-0.382 


-0.703 


0 .048 


MuHd_075 : 


24.326 


4.153 


5.947 


2 .450 


MuHd_0 95 : 


12 . 638 


7 .213 


4.218 


1 .796 


MuHd_l 0 0 : 


13 .748 


8.827 


4 .477 


2 .427 


MuHd__l 0 5 : 


16.581 


11 .426 


5.056 


3 .453 


Hmax_075 : 


0 .438 


0.233 


-2.490 


1 .692 


Hmax_0 9 5 : 


0.525 


-1.400 


-2 .394 


0 . 674 


Hmax_l 0 0 : 


-0 . 100 


-1 .200 


-2 .292 


1 .550 


Hmax_l 0 5 : 


0 . 500 


-0 . 000 


-2 .164 


1 .540 



PROBABILITY 
of export to mitochondria: 0.0148 
Clytin : 

Sequence name: CLYTIN 

Input sequence length : 198 aa 



VALUES OF COMPUTED PARAMETERS 
Net charge of query sequence : -9 

Analysed region : 32 

Number of basic residues in targeting sequence : 6 
Number of acidic residues in targeting sequence : 2 
Cleavage site : not predictable 

Cleaved sequence : - 



HYDROPHOBIC SCALE USED 



H17 
MesoH 
MuHd_075 
MuHd_0 9 5 
MuHd_l 0 0 
MuHd_l 0 5 
Hmax_07 5 
Hmax 095 



GES 


KD 


GVH1 


ECS 


-0.429 


0.341 


-0.313 


0.313 


-1.778 


-0 .307 


-0.718 


0.053 


32 . 928 


17.509 


7 . 351 


5. 708 


30.874 


20 .344 


9.074 


5.834 


36 . 596 


22 . 666 


10.051 


6.762 


39 .174 


19 .336 


10 . 379 


7 . 609 


4.900 


7 . 087 


-1.223 


3.684 


13 . 600 


10.100 


1.251 


4.390 
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Hmax_100 : 14.000 12.600 1.601 5.060 

Hmax_105 : 6.650 13.067 -0.468 3.920 

PROBABILITY 

of export to mitochondria: 0.2047 
clytin-2 : 

Sequence name: CLYTIN-2 

Input sequence length : 19 8 aa 





VALUES OF COMPUTED PARAMETERS 






Net charge of query sequence 






-7 


Analysed region 










16 


Number of basic 


residues 


in targeting 


sequence 




3 


Number of acidic 


residues 


in targeting 


sequence 




1 


Cleavage site 


: not 


predictable 








Cleaved sequence 


• 












HYDROPHOBIC SCALE USED 








GES 


KD 


GVH1 




ECS 


H17 : 


-0 .288 


0.341 


-0 .213 


0 


.313 


MesoH : 


-1 . 519 


-0.206 


-0 . 681 


0 


. 081 


MuHd_07 5 : 


32 . 594 


15.092 


8 . 192 


4 


. 075 


MuHd_095 : 


36.090 


19.707 


8.836 


6 


.716 


MuHd_l 0 0 : 


38 . 617 


20 .269 


9 . 682 


6 


.851 


MuHd_l 0 5 : 


30 .267 


16.082 


8.229 


5 


. 470 


Hmax_07 5 : 


6 . 533 


6.417 


-0.793 


2 


. 508 


Hmax_0 9 5 : 


13 . 600 


10 . 100 


1.251 


4 


.390 


Hmax_l 0 0 : 


13 . 600 


10 . 100 


1.251 


4 


.390 


Hmax_l 0 5 : 


13 .417 


10 . 150 


1 . 612 


3 


. 862 



PROBABILITY 

of export to mitochondria: 0.3 974 
mtClvtin : 

Sequence name: mtClytin 

Input sequence length : 22 8 aa 



VALUES OF COMPUTED PARAMETERS 
Net charge of query sequence 
Analysed region 



-8 
34 
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Number of basic 


residues 


in targeting 


sequence 


• 
• 


6 


Number or acidic 


residues 


in targeting 


sequence 


• 
• 


0 


Cleavage site 




: 17 








Cleaved sequence 




: MQRFTNRLLSMSALRA 






• 




HYDROPHOBIC SCALE USED 








GES 


KD 


GVH1 




ECS 


H17 : 


-0 . 135 


0.453 


-0 . 343 


0 


.309 


MesoH : 


-1 . 623 


-0.215 


-0.701 


0 


. 073 


MuHd_075 


33 .394 


19 . 322 


8 . 634 


7 


. 593 


MuHd_0 9 5 : 


34 . 726 


19 . 634 


8. 110 


8 


. 861 


MuHd__100 : 


32 . 825 


16. 596 


7 .376 


7 


. 520 


MuHd_l 0 5 


28 . 005 


19.893 


7.410 


7 


.865 


Hmax_075 : 


16. 683 


17.733 


2 . 851 


5 


.763 


Hmax_0 9 5 : 


13 .125 


13.388 


2 .299 


4 


.314 


Hmax_100 : 


8.300 


11 . 500 


1.845 


3 


. 830 


Hmax_l 0 5 : 


1.700 


9.500 


-1.171 


2 


.390 



PROBABILITY 
of export to mitochondria: 0.9974 

The probability of a translocation of the analyzed peptide into mitochondria increases as the 
calculated factor approaches 1 . 

The analysis of the protein sequences of obelin, aequorin, clytin, clytin-2 and mtClytin has shown 
that only mtClytin has the features of a protein which can be transported into mitochondria. 

Example 11 

Figure 9 shows the alignment of mtClytin, clytin (Clytia gregaria) and clytin-type2 at the amino 
acid level. 
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